IMPROVEMENTS IN AND RELATING TO IDENTIFICATION 



This invention concerns improvements in and relating to identification, 
particularly in die field of forensic science, and particular but not exclusively relating to 
identification techniques based on tbe use of single nucleotide polymorphisms. 

In a wide variety of situations it is desirable to be able to obtain information about 
the contributors to a mixture of DNA. Existing techniques are either limited in terms of 
the range of known factors which must be available for meaningful results to be obtained 
and / or limited in terms of the concentration of DNA which must be available from each 
of the sources to be considered and / or die relative proportions of the DNA sources 
contributing to the mixture, 

The present invention aims to provide a technique which is more versatile in 
terms of the type of situation which can meaningfully be considered and / or be more 
useful in terms of the range of concentration which can be usefully considered and / or be 
more useful in terms of the proportions of the DNA contributed to a mixture by more than 
one contributor which can be considered. 

According to a first aspect of the present invention we provide a method for 
indicating the likelihood that a DNA mixture arose from sources of a defined type where 
the DNA mixture is formed by DNA samples from more than one source, the method 
involving:* 

the determination of the identity of the alleles present at a locus for the DNA in 
the mixture; 

determining a first probability function for the situation where the DNA mixture 
is formed from samples arising from the given person and from a first other person; 

determining a second probability function for the situation where the DNA 
mixture is formed from samples arising from a second other person and a first other 
person; 

using the first probability function as numerator and the second probability 
function as denominator in determining a likelihood ratio for the mixture having arisen 
from the defined type of sources considered in the first probability function; 

determining such likelihood ratios for a plurality of loci; and 
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combining the likelihood ratios to give a combined likelihood ratio for the 
mixture having arisen from the defined type of sources considered in the first probability 
function. 

The defined type may assign an origin to one or both of the sources contributing 
to the mixture. The given person may be a suspect or other known person under 
investigation. 

The first other person, particularly where the mixture is being considered as 
potentially arising from a suspect and an unknown person, may be an unknown person. 
The second other person in such cases may also be an unknown person. 

The first other person, particularly where the mixture is being considered as 
potentially arising from a suspect and a victim, may be a known person, such as the victim. 
The second other person in such cases may be an unknown person, particularly neither the 
suspect or victim. 

Preferably the mixture arises from only two sources. 

The identity of the alleles may be determined using techniques for identifying 
single nucleotide polymorphisms. 

Preferably the first probability function is the probability that the defined type 
provides one or both of the mixture sources, ideally based on the frequency of occurrence 
of the possible allele combinations which could generate the identified allele identity or 
identities for that locus. The identity of the alleles at a locus, from the two sources, may 
be the same or different 

In a first embodiment of the method, where the defined type is the given person 
and an unknown person, the first function may be based on the frequency of occurrence of 
the different possible allele combinations for the unknown person which are possible 
knowing the given persons alleles at that locus. The first function, preferably the 
numerator thereof may be any one or more of the numerator functions set out in Figure 1. 
The first function, preferably the numerator thereof, may be the numerator function set out 
in Figure 1 for the given respective allele identity set out in Figure I, 

In a second embodiment of the method, where the defined type is the given 
person and the first other person is a known person, the first function may be defined as 1 . 

Preferably the second probability function is the probability that the first and 
second other persons provide the identity for the mixture sources, ideally based on the 




frequency of occurrence of possible allele combinations which could have generated the 
identified allele identity of identities for that locus. The identity of the alleles at a locus, 
from the two sources, may be the same or different 

In a first embodiment of the method, where the defined type is the given person 
and an unknown person, the second function may be based on the frequency of occurrence 
of the different allele combinations which ere possible from the two unknown persons 
which give the allele identity or identities obtained The second function, preferably the 
denominator thereof, may be any one or more of the denominator functions set out in 
Figure 1 . The second function, preferably the denominator thereof, may be the 
denominator function set out in Figure 1 for the given respective allele identity set out in 
Figure 1. 

In a second embodiment of the method, where the defined type is the given 
person and the first other person is a known person, the second function may be based on 
the frequency of occurrence of the different possible allele combinations for the unknown 
person which are possible knowing the known person's alleles at that locus. The second 
function, preferably the denominator thereof, may be any one or more of the denominator 
functions set out in Figure 3. The second function, preferably the denominator thereof, 
may be the denominator function set out in Figure 3 for the given respective allele 
identities set out in Figure 3. 

The method may be applied to two or more loci, but is preferably applied to at 
least 20 loci and still more preferably at least 30 loci. The method may be applied to SO or 
more, 100 or more, 150 or more or even 200 or more loci to increase the statistical 
significance ofthe results. 

The combined likelihood ratio may be obtained by multiplying the individual 
likelihood ratios together. 

To estimate the optimum number of loci used, preferably in an array, a theoretical 
likelihood ratio may be used, ideally calculated from:- 



mp 





where n is the number of loci; mpis the number of possible allele identities for a 
simple mixture; LR is the likelihood ratio; LR is the combined likelihood ratio; and fin is 
the proportion of an array of a loci having a particular mixture type m. 



The proportions of the loci (fin) having the specified identities (mixture type) may 
be as set out below > 



Mixture type (U,S) 


AAAA 


AAAB 


ABAA 


Frequency of 
observations (fa) 




2/a 3 /b 


2fi£fi> 




Mixture type (U,S) 


AABB 


BBAA 


ABAB 


Frequency of 
observations (fa) 


fotft? 


fa z fl>* 






Mixture type (U,S) 


ABBB 


BBAB 


BBBB 


Frequency of 
observations (fa) 


2fc/b J 


2/a/b* 





Where the allele identity or identities of a given person and/or known first other 
person are under consideration* the method may include the determination of the allele 
identity or identities at one or more of the loci under consideration from DNA obtained 
only from the given person or known first person. 

In an embodiment of the invention, particularly where the defined type is the 
given person and the first other person is a known person, such as a victim, it is preferred 
that at least some of the loci considered in the method are those in which the given person 
and first other person are known to differ in allele identity. The method may consider only 
loci at which the given person and known first person have alleles which are different 

In an embodiment of the invention, particularly where the defined type is the 
given person and the first other person is a known person, such as a victim, the method 
may consider loci at which the given person and known first person are known to have the 
same homozygous allele identity. Preferably in such cases the method includes the 
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establishment of a probability value that the other identity, for instance AA or BB, is 
absent The probability value may involve an investigation of the background noise level 
from the allele identity investigating process, for instance a PCR based amplification 
process. The investigation may involve the introduction of one or more negative control 
samples. The investigation may involve the determination of a cumulative probability 
density function for one or more or all of the negative controls. This function may be used 
to establish the level and/or proportion of DNA in the mixture which would have given 
detection of the identity being established as absent The level and/or proportion may be 
compared with other information thereon. 

In an embodiment of the invention, particularly where the defined type is the 
given person and the first other person is a known person, such as a victim, the method 
may involve the establishment of a probability value that the given person's allele identity 
or identities has not been detected. In one instance, the probability value may relate to the 
given person's allele identity being different from that of the known first other person's. 
The known first other person's allele identity may be AA or BB, where A and B designate 
the two possible allele identities at that SNF. In such cases, the given person's allele 
identities accounted for may be BB, B A, AB where the first other person's identity is AA 
and/or the given person's allele identities accounted for may be AA, BA, AB where the 
first other person's identity is BB. The probability value may be accounted for by the 
equation:- 



LR = 



p (B * null) 

[lab + b 2 ]p{B t null) + a 2 jiB = null) 



where a and b are allele frequencies of A and B respectively. 
In a second instance, the probability value may relate to the given person's allele 
identity being the same as that of the known first other person's. The given person and the 
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known first other person's allele identity may be AA or BB, where A and B designate the 
two possible allele identities at that SNP, In such cases, the possibility that the mixture 
was formed by a sample from a second other person, rather than the given person, may be 
accounted for. In such cases, the second other person's allele identities accounted for may 
be BB, BA, AB where the given person and first other person's identity is AA and/or the 
second other person's allele identities accounted for may be AA, BA, AB where the given 
person and the first other person's identity is BB. The probability value may be accounted 
for by the equation :- 



LR p(b = null) 



[lab + b 2 ]p(B * null) + a 2 p(B = null) 



where a and b are allele frequencies of A and B respectively. 

The method may further include the prediction of the proportion of the mixture 
arising from the person other than the first other person, for instance from the suspect as 
the given person. The method may include an estimate or calculation of a value for 
p(null). The value for p(null) may be calculated from a cumulative probability density 
function. The calculation may be derived from experimental data obtained by probing 
negative controls with respect to one or more allele identities. 

Various embodiment of the invention will now be described, by way of example 
only, and with reference to the accompanying drawings in which:- 

Figure 1 illustrates in tabular form Che likelihood ratio numerator and 
denominator for each of the nine possible phenotype combinations when a 
mixture is under consideration as coming from an unknown individual and a 
suspect, with the proportion of mixture phenotypes expected (fin) also provided; 
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Figurc 2 is a plot of combined likelihood ratios for arrays tanging from 
SO to 200 loci when there is a mixture under consideration as coining from a 
suspect and an unknown individual; 

Figure 3 illustrates in tabular form the likelihood ratio denominator for 
each of the nine possible phenotype combinations when a mixture is under 
consideration as coming from a victim and a suspect; 

Figure 4 is a plot of combined likelihood ratios for arrays ranging from 
SO to ISO loci where there is a mixture under consideration as coming from a 
suspect and a victim; 

Figure 5 illustrates in tabular form DNA profile results for the victim and 
suspect in the first two columns with the multiplex test being specifically directed 
towards the alleles that are not found in the victim, as shown in the difference 
column with results column showing those expected if the suspect is the 
perpetrator (there are at least two possible genotypes to consider in the likelihood 
ratio where frequencies have then been incorporated into the calculation); 

Figure 6 exemplifies the results from four mitochondrial loci multiplex 
together using the universal primer approach, with sample 1 designated 
247G;195T;152T;146T; sample 2 designated 247G;195C;152C;146C; and sample 
3 is designed 247G;195T;152C;146C; 

Figure 7 illustrates a demonstration of amplification specificity and 
sensitivity of detection of mitochondrial DNA with the left hand series of figures 
showing detection of the major component of the mixture coding for the rot0073G 
polymorphism and the right hand series showing detection of the minor 
component of the mixture, coding for the mt0073A polymorphism, established in 
a the sensitivity of the test at Circa 12.5pg genomic DNA; 

Figure 8 illustrates results where primer 416A is tested against GclS- 
lS;lf-lf;2-2 and negative control; 

Figure 9 illustrates singleplex reactions to test specifity of the forward 
primer 420G, which detects both the Gel polymorphisms, with the primer tested 
against a series of individuals -GclS-iS;lf-lf;2-2; and a negative control; 



Figure 10 illustrates results fiom a singleplex reaction to test sped&ty of 
forward primer 420T which detects Gc2 with the samples being fiom lS-lS;lf- 
lf;2-2; and negative control; and 

Figure 1 1 demonstrates the limits of detection using a dilution series of a 
1S-1S individual with samples prepared at lng, 200pg, 400pg and 800pg 
respectively. 

Analysis of multi-source samples 

For any given single nucleotide polymorphism locus there can only be two 
different alleles. In the following discussion these will be designated A and B. 

Assuming that a DNA sample under consideration is a mixture with two 
contributors, if analysis of the mixture reveals just one allele appearing at the locus then 
both the contributors to the mixture must be homozygous for the same allele (AA; AA or 
BB;BB depending on the one allele determined). 

If two alleles are visible in die experimental results then a considerable number of 
possibilities for the genotype combinations apply. Where two contributors are involved, 
the possible combinations are: AA,AB; AAfiB; AB3B; AB,AB; and all of the reverse 
possibilities too. In total, nine possible genotype combinations exist for a two contributor 
mixture when both alleles are detected for a given locus. 

This basic situation can be extended to relate to a two source mixture involving a 
suspect and an unknown individual and to a situation involving a two source mixture 
where a victim and a suspect are under consideration. Such situations where one of the 
contributors constitutes only a relatively minor part of the mixture can also be investigated 
using (he technique set out in more detail below. 

Contributors to the Mixture are Suspect and an Unknown Individual 

For example* suppose that a blood stain is retrieved from a crime scene and the 
phenotypes are consistent with the combination of a suspect and an unknown individual. 
Two possible situations exist for which a likelihood ratio can be considered, first situation 
"C" where the contributors were the suspect and the unknown individual and secondly 
situation "not C" where the contributors are two unknown individuals. 




For any given locus under consideration, the calculation of the likelihood ration 
will depend upon the phenotype of the suspect and the alleles actually observed in the 
mixture. Three broad categories exist in this regard:- 

Category 1 - where the suspect is homozygous (AA) and the profile 
shows just one allele and as a consequence the unknown must be AA also, thereby giving 
theLR=l/fe 2 . 

Category 2 - where the suspect is homozygous (AA) and the mixture is 
AB and as a consequence the unknown must either by AB or BA or BB. In this case the 
probability of situation C - fb 2 + 2fe fb and the likelihood of situation not C = 6fe 2 fb? + 
4fa' ft + 4£afb J thereby giving a likelihood ratio = (2fefb + fbtytftftf 1 46?fb + 4fefb 3 ). 

For Category 3 - the suspect is heterozygous (AB) and the profile is AB 
and as a consequence the unknown must be AA, AB, B A or BB. In this case the 
probability of situation C = (fa +fb) 2 and the probability of situation not C is the same as 
for the Category 2 case thereby giving a likelihood ratio « (fa + fbfl(6fsPf& + 4fa 3 fb + 
4fafb 3 ). 

A complete list of the numerators and denominators for the likelihood ratios for 
the nine possible genotype combinations (m = 1 to 9) are set out in the table of Figure 1 , 

If an array of n different loci are considered, the proportion of an array of n loci 
having a particular mixture type is fin; and if for each locus there are mp = 9 possible 
mixture phenotype combinations the combined likelihood ratio for the n loci is 



mp 

UL = H LR (finxn) 

m=l 




As an illustration of how the likelihood ratio plots vaiy for arrays involving 50 to 
200 different loci (in die case of a mixture with a suspect and an unknown individual) 
reference is made to the plots set out in Figure 2. In this illustration arrays of SO, 100, 150 
and 200 loci were considered. The plot refers to the combined likelihood ratio for the n 
loci with the simplifying assumption that the allele frequency, f(a), for each locus is the 
same across all the loci. The combined likelihood ratio maximises when the frequency of 
allele a is high (0.8) or low (0.2). A battery of 50 loci with frequencies of alleles ranging 
between 0.1 and 0.9 will give a minimum LR of 10 4 and a battery of 200 loci will give a 
minimum LR of 1 0 U , thereby indicating significant statistical power.. 

Contributions to the Mixture are Victim and Suspect 

In many situations, such as a typical rape forensic investigation, the mixture 
comprises contributions from both the victim and a suspect, thus there are two potential 
situations to be considered, situation "C" where die contributors to the mixture are the 
suspect and the victim, and situation "not C 1 where the contributors are the victim are an 
unknown individual. 

Once again considering a mixture profile, with two alleles indicated, a number of 
potential positions arise. 

Firstly, if the profile comprises two alleles (AB) and the victim is known to be 
AB then the suspect may be AA, AB, B A or BB. The probability for situation C is thus *= 
1. The probability for situation not C~(£a + fb) J , This therefore gives a likelihood ratio = 
l/(fa+fb) 2 . 

Secondly, if the profile comprises two alleles (AB) and the victim is homozygous 
(AA) then the suspect is either AB, BA or BB. In this case the probability of situation C « 
1 once again, and the probability for situation not C - 2fefb + fb*. The likelihood ratio, 
therefore » l/(2fafb + fb 2 ). 

If die profile shows a single allele and both the victim and suspect are 
homozygote, (AAfAA), as a consequence die likelihood ratio » 1/A*. 

The table of Figure 3 illustrates the likelihood ratio denominator and frequency 
(fin) information for the potential nine genotype combinations. Figure 4 illustrates in 
graphic form the plots obtained for SO, 100 and 1 50 loci based analysis in such cases. 
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Analvsis of mixtures with minor contribution from one sample 

As well as offering the above mentioned general consideration of the likelihood 
of the DNA in a sample from a number of sources applying, the present invention also 
offers the possibility of successful analysis even where one of the parties was only a minor 
contributor to the mixture, less than 10% of the mixture. 

Techniques for analysing mixtures are known based around the use of a short 
tandem repeats (STR's) as described by Clayton et al (1998) Analysis and Interpretation of 
Mixed Forensic Stains Using DNA §TR Profiling. Int J. Forensic Sci. 91, 55-70. The 
analysis of minor components and mixtures using STR based techniques, however, is 
particularly problematical when the minor component is present at a very low level (less 
than 1 in 10). Below this level, allele indications from the minor sample are close to the 
background noise and are difficult to distinguish as a result 

The technique of this invention can be applied to address such situations and still 
obtain meaningful results, however, particularly where the amplification set out in more 
detail below is applied. The technique of the present invention also offers different 
information from the analysis in the event of particular allele combinations. 

In the technique, when a DNA sample is obtained which needs analysis and for 
which two contributors are suspected, then it is desirable to base the investigation of that 
sample on a method tailored to the DNA profile of the victim. When using SNP (single 
nucleotide polymorphism) based analysis, the most useful loci are those which are 
homozygous in the victim (where the victim is either AA or BB) as only then can detection 
of the other possible allele in the mixture imply information about the other contributor to 
the sample, possibly the perpetrator of a crime. 

If the victim is heterozygous at the locus in question, then less useful information 
can be obtained since both alleles contributed by the victim will mask any alleles 
contributed by the perpetrator. Even so, where both the suspect and victim are the same 
homozygote ( AA for instance) a new type of information can be provided In such cases 
the allele B will be absent from the mixture and this can be confirmed using (he present 
technique as the background level in the analysis is negligible, thus removing any 
argument that a BB contribution was present but was too small to detect 

Figure 5 illustrates potential results for 7 different loci, including the victim allele 
identities, suspect allele identities, potential allele identities for a perpetrator and the 




likelihood ratio that the suspect was a contributor to the mixture under consideration. In 
this instance, therefore, only loci 1, 3, 5, 6 and 7 are potentially informative and worthy of 
analysis, this can be determined from an initial profile of the suspect* s and victim's DNA 
from a clean sample obtained from those persons. 

flighlv speg f fc ampli fication technique 

Whilst the technique described in this application is applicable to all such analysis 
techniques, it offers particular advantages in providing information where the background 
noise from experimentally obtained data is minimised. In this regard reference is made to 
the technique described in the common applicants patent application number OB 
9917307,2 filed 23 July 1999 which describes a highly specific amplification technique 
which minimised background noise as a result. The contents of that application are fully 
incorporated herein by reference, particularly for the purposes of providing such a highly 
specific amplification technique. 

Analysis of mixtures with very minor contributions from one sample 

Even whh the substantial reduction of background noise potential problems 
remain where one party's contribution is very much smaller than the other, less than 1 in 
20 or even potentially down to situations where the contribution is less than 1 in 100. 
There are also potential problems where the mass of the mixture available for analysis is 
small Qess than 25pg). In both such cases there is a problem in that the alleles of the 
minor contributing party may not report to the results in a detectable way. 

For STR's (short tandem repeats) even though the proportion of the mixture 
contributed by individual X relative to individual Y is similar between the different loci 
within the mixture, if the proportion of one party's contribution to the mixture is much 
lower than the other then the lower proportion allele is not necessarily observed in the 
results. This is a particular problem with STR's as the means of identification in that case 
depends on two pieces of information:- a), the mobility of allele in the electrophoretic gel 
and b) the relative concentration of intensity of the band which is used to assign the band 
to cither the major or minor contributor of the mixture. 

SNPs offer a considerable advantage in this area as the assay is purely 
quantitative, no mobility information need be obtained. The limits of detection, therefore 
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are entirely dependent upon the levels of background noise inherent in the assay, as well as 
minimising the noise effect, the present invention offers the chance to provide further 
statistically relevant information by accounting for such potential non-reporting in its 
theory. 

In the following explanation, cases where one parties contribution, a suspect for 
example, to the mixture is potentially very minor are considered and where, as a result, the 
allelic signal from that party's contribution is so close to the background noise threshold 
that it is difficult to distinguish from the noise. 

Further assuming that the suspect «* BB and the victim - AA then the profile 
should illustrate both A and B alleles, However, as previously stated if the proportion of 
the mixture contributed from suspect is very low or if the amount of DNA contributed by 
(he suspect is very low then the B allele might not be detected in the results (potentially 
because it is swamped by the background noise of the system used)* In such cases we 
need to interpret the information based on this potential non-observance (B is not null, it is 
present but not distinguished from the background noise) in relation to situation "C", 
where die contributors are Ac suspect and the victim, and the situation "not C" where the 
contributors are the victim and an unknown person who is not the suspect 

This can be expressed in the function set out below, where the numerator 
accounts for the alternative possible contributions from the suspect, as minor contributor. 
. If B is present in reality, and not null, then the phenotypes which might contribute are AB, 
BA or BB; alternatively if B is not present even in reality (B is null) because the 
contributor does not possess this allele, this leaves AA as the only possibility. The 
function, the likelihood ratio, is summarised as follows:- 



LR = 




[2ab+ b 2 ]p(B * null) + a 2 p(B = null) 
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Similarly, if the suspect is AA and die victim is AA, but the suspect contribution 
is very low, even if the profile only reveals A the possibility that the actual perpetrator is 
AB, BA or BB must be evaluated, This gives the function :- 



p{B = null) 

LR = 



[lab + b 2 ]p{B * null) + a 2 p(B = null) 



Given an estimate of die proportion of the minor contributor in the mixture, 
p(ruill) can be directly estimated from die cumulative probability distribution functions of 
the background controls for each locus. The lower the background signal is established to 
be, the lower p(null) value must be. Thus very greatly increased certainty can be 
expressed that the allele identity not reported in the results was due to it not being present 
in the samples which contribute to die mixture, rather than being there but not detected. 



Experimental illustrations of invention 

Tully et al (1996) described a mini-sequencing approach to analyse mitochondrial 
DNA SNPs. The SNFs listed in table 1 were analysed using the approach described above 
with universal G or universal C attached to the 5 ! end of the primers listed, The sizes of 
each DNA fragment are known - when run on a gel, bands which are either JOE (green) or 
FAM (blue) labelled are visualised 



Table 1 

Forward Primers 

Position Primer s tquenee Sequence Listing 3 P polymorphism 

nscdwfth Universale ID do. used with UnWeraJ G 

73 GTAlTTrCGTCTGGGGGGTA 1 G 

146 GTCTGTCTTTGATTCCTGCCC 2 T 

152 TTTGATTCCTGCCTCATCCC 3 T 

195 ATATTACAGOCGAACATACC 4 T 

247 GCTTOTAGGACATAATAATAACAATTA 5 G 



Reverse Primer 

326 CAGAGATGTGTTTAAGTGCTGT 



6 




NB. Universal primer C was dye labelled with FAM (blue) and Universal primer G was 
labelled with JOE (green). 

Reaction conditions: 

For each separate reaction: 

DNTPs were at a final concentration of 35mM 

Perkin Elmer (PE) buffer was at a final concentration of <X375mM with 0.375mM MgCl^ 

0.25 AmpliTaq (PE) was added to 50ul reaction. 

Primer concentrations are detailed separately with (he examples given. 

All phenotypes were verified by independent analysis using the mini-sequencing method 

ofTullyetal(1996), 

Example 1 : multiplexed mitochondrial DNA 
Reaction conditions: 
DNTPs all at 10mM; 

Final concentration of 35mM PE buffer 1 5mM 1 5raM MgCl2 per reaction MgC12 = 
.375mM. AmpliTaq = .25ul in 50ul 

In the following example, luM of each of die forward primers and 2uM of the 
reverse primer listed in table 1 was used in die reaction mixture. A 50ul reaction 
containing 0.3ng of genomic DNA was amplified through 8 cycles at 94C for 30sec; 57C 
for 30sec and 72C for 90sec. An aliquot of Sul of the reactant was then transferred into a 
second tube containing luM of each forward universal primer and luM of the reverse 
primer and luM of the reverse universal primer. This was amplified for 22cycles at 94C 
for 30sec» 62C for 30sec and 72C for 90sec. Samples were electrophoresed on a ABD 377 
automated sequencer with Rax 500 sizing standard The negative control was treated 
under the same conditions, except that no DNA was added to the reaction. 

Four mitochondrial loci were multiplexed together using this universal primer 
approach. The results are illustrated in Figure 6, where sample 1 is designated 247G; 
195T; 152T; 146T; sample 2 is 247G; 195C; 152C146C; sample 3 is designated 247G; 
19ST; 152C; 146C The regularly spaced small peaks are size standards. 
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Elucidation of a mixture where the minor component is <lOpgDNA (genomic equivalent 

In the next example, the results for which are illustrated in Figure 7, mixtures 
were prepared with flic major component coding for the mt0073G polymorphism (2ng 
genomic DNA) and the minor component coding for the mt00326 polymorphism (0-50pg). 
Amplified with forward primers, either ml0073-G or mt0073-A (luM) and the reverse 
primer mt00326 (luM), the cycling conditions were the same as described previously but 
the second round amplification was just 3 cycles. 

In the first experiment, left hand side results, the primers used were mt0073-G 
(luM) and mt00326 (luM) whereas in the other experiment, right hand side results v the 
primers were mt0073-A (luM) and mt00326 (luM). The results showed that even in the 
presence of very great excess of mt0073-G template, there was no mt0073-A background 
product detected Similarly using just primer mt0073A there was no mt0073-G detected. 
The high specificity of the reaction demonstrated discrimination of minor components in 
mixtures down to extremely low levels of I2,5pg in a total - a mixture ratio of 1:200. 

Example 3 

Genomic DNA - Group Specific Component ( Gc) 

The Gc single nucleotide polymorphisms have all been well characterised (Braun 
et al, 1992). In addition a large number of rare variants have been identified - (he test 
described here only detects the common alleles * 

Gc 2, GclF and GclS. Reynolds and Sensabaugh (1 990) compared cDNA sequences of 
Yang et al (1985) and Cooke and David (1985). Although polymorphisms were observed 
at 4 different sites, the most informative are at codons 416 and 420, where single base 
changes result in an amino acid change. At triplet 41 6, GATA codes for an aspartic acid 
residue in the Gc2 and GclF phenotypes, whereas GclS has a glutamic acid residue 
determined by codon CaG. Amino acid 420 is a lysine residue in the Gc2 phenotype 
coded by AAG; a threonine residue in botfi Gel phenotypes is coded by ACG. 

Four different forward primers were prepared to distinguish between the various 
polymorhphisms (table 2, 3). These primers were attached at die ? end to universal 
primers as described previously. 
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Table 2 






codon 


seqnence 


sequence listing ID no. 


forward primers 






420G/T 


ACCAGCTTTGCCAGTTCCR 


7 


416C/A 


TTCCGTGGGTGTGGCX 


8 


reverse primer 


GGCAGAGCGACTAAAAGCAAA 


9 



Sequence of primers used to detect GclF, OclS and Gc2 polymorphisms. R=G 
or T; X~C or A. 420T and 416A were attached to FAM labelled universal primer G; 420G 
and 416C were attached to JOE labelled universal primer C. 



Table 3 



416 




420 


G 


T 


A 


GclF 


Gc2 


C 


GclS 





The Gc phenotypes are dependent upon the codon mutations detected. Note that 
41 6A and 420T do not exist together in coupling. The 420G primer detects Gel 
phenotypes; 420T detects Gc2; 41 6C detects Gc2 or GclF (dependent on codon 420 
sequence); 416A detects GclS. 

Explanation of Examples: 

A series of examples are given (Figures 8 to 13). Two aspects were tested, 
specificity and sensitivity. To cany out specificity tests, a series of singleplex reactions 
were carried out. In Figure 8, primer 416A was shown to be a specific test for the GclF 
polymorphism by testing against Gc 1S-1S; 1F-1F; 2-2 and a negative control, with only 
the 1F-1F giving a single and no background being observed with the other samples. 
Similarly, primer 420G was specific for Gel polymorphisms (Figure 9) with only the first 
two samples giving a signal; and 420T was specific for Gc2 polymorphisms (Figure 10). 
The system was demonstrated to work with all primers in a cocktail mix (Figures 11,12 




and 13). Furthermore sensitivity of detection was c. 8pg genomic DNA (Figures 11 and 
13). A mixture was analysed (Figure 12); this demonstrated that mixtures are easily 
interpreted, the different molecular weights of FANf and JOE confer different molecular 
weights on the DNA fragments the same size, and this facilitates interpretation. It is easy 
to distinguish the artefact pall-up from a true allele (Figure 14). 

Reaction conditions 

The reagent concentrations were the same as described for mitochondrial DNA 
Primer concentrations used were 125nM for the locus specific forward primers and the 
reverse primer. The universal forward primers were at lOOnM, and the universal reverse 
primer at 288nM Locus specific and universal primers were admixed in a single tube 
reaction. The cycling conditions used were 94C for 30 sec; 61C for 30 sec; 72C for 90 sec 
for 35 cycles, followed by 72C for 10 rain. 

All phenotypes were verified by independent analysis using conventional 
isoelectric focussing. 



